Generalized Zero-Shot Learning for Action Recognition with Web-Scale Video Data

نویسندگان

Kun Liu

Wu Liu

Huadong Ma

Wenbing Huang

Xiongxiong Dong

چکیده

Action recognition in surveillance video makes our life safer by detecting the criminal events or predicting violent emergencies. However, efficient action recognition is not free of difficulty. First, there are so many action classes in daily life that we cannot pre-define all possible action classes beforehand. Moreover, it is very hard to collect real-word videos for certain particular actions such as steal and street fight due to legal restrictions and privacy protection. These challenges make existing data-driven recognition methods insufficient to attain desired performance. Zero-shot learning is potential to be applied to solve these issues since it can perform classification without positive example. Nevertheless, current zero-shot learning algorithms have been studied under the unreasonable setting where seen classes are absent during the testing phase. Motivated by this, we study the task of action recognition in surveillance video under a more realistic generalized zero-shot setting, where testing data contains both seen and unseen classes. To our best knowledge, this is the first work to study video action recognition under the generalized zero-shot setting. We firstly perform extensive empirical studies on several existing zero-shot leaning approaches under this new setting on a web-scale video data. Our experimental results demonstrate that, under the generalize setting, typical zero-shot learning methods are no longer effective for the dataset we applied. Then, we propose a method for action recognition by deploying generalized zero-shot learning, which transfers the knowledge of web video to detect the anomalous actions in surveillance videos. To verify Kun Liu · Wu Liu · Huadong Ma · Xiongxiong Dong Beijing University of Posts and Telecommunications, Beijing, China Tel.: +086-62-282767 Fax: +086-62-282767 E-mail: {liu kun, liuwu, mhd, dong-bupt}@bupt.edu.cn Wenbing Huang Tencent AI Lab, Beijing, China E-mail: [email protected] ar X iv :1 71 0. 07 45 5v 1 [ cs .C V ] 2 0 O ct 2 01 7

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Label Zero-Shot Human Action Recognition via Joint Latent Embedding

Human action recognition refers to automatic recognizing human actions from a video clip, which is one of the most challenging tasks in computer vision. Due to the fact that annotating video data is laborious and timeconsuming, most of the existing works in human action recognition are limited to a number of small scale benchmark datasets where there are a small number of video clips associated...

متن کامل

Alternative Semantic Representations for Zero-Shot Human Action Recognition

A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost...

متن کامل

Exploring Semantic Inter-Class Relationships (SIR) for Zero-Shot Action Recognition

Automatically recognizing a large number of action categories from videos is of significant importance for video understanding. Most existing works focused on the design of more discriminative feature representation, and have achieved promising results when the positive samples are enough. However, very limited efforts were spent on recognizing a novel action without any positive exemplars, whi...

متن کامل

Action Change Detection in Video Based on HOG

Background and Objectives: Action recognition, as the processes of labeling an unknown action of a query video, is a challenging problem, due to the event complexity, variations in imaging conditions, and intra- and inter-individual action-variability. A number of solutions proposed to solve action recognition problem. Many of these frameworks suppose that each video sequence includes only one ...

متن کامل

A Generative Approach to Zero-Shot and Few-Shot Action Recognition

We present a generative framework for zero-shot action recognition where some of the possible action classes do not occur in the training data. Our approach is based on modeling each action class using a probability distribution whose parameters are functions of the attribute vector representing that action class. In particular, we assume that the distribution parameters for any action class in...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1710.07455 شماره

صفحات -

تاریخ انتشار 2017

Generalized Zero-Shot Learning for Action Recognition with Web-Scale Video Data

نویسندگان

چکیده

منابع مشابه

Multi-Label Zero-Shot Human Action Recognition via Joint Latent Embedding

Alternative Semantic Representations for Zero-Shot Human Action Recognition

Exploring Semantic Inter-Class Relationships (SIR) for Zero-Shot Action Recognition

Action Change Detection in Video Based on HOG

A Generative Approach to Zero-Shot and Few-Shot Action Recognition

عنوان ژورنال:

اشتراک گذاری